To Cluster, or Not to Cluster: How to Answer theestion

نویسندگان

  • Andreas Adolfsson
  • Margareta Ackerman
  • Naomi C. Brownstein
چکیده

Clustering is an essential data mining tool that aims to discover inherent cluster structure in data. For most applications, applying clustering is only appropriate when cluster structure is present. As such, the study of clusterability, which evaluates whether data possesses such structure, is an integral part of cluster analysis. However, methods for evaluating clusterability vary radically, making it challenging to select a suitable measure. In this paper, we perform an extensive comparison of measures of clusterability and provide guidelines that clustering users can utilize to select suitable measures for their applications. ACM Reference format: Andreas Adolfsson, Margareta Ackerman*, and Naomi C. Brownstein*. 2016. To Cluster, or Not to Cluster: How to Answer the ‹estion . In Proceedings of Knowledge Discovery from Data, Halifax, Nova Scotia, Canada, August 13–17 (TKDD‘17), 9 pages. DOI: 10.1145/nnnnnnn.nnnnnnn

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

CLUSTER ALGEBRAS AND CLUSTER CATEGORIES

These are notes from introductory survey lectures given at the Institute for Studies in Theoretical Physics and Mathematics (IPM), Teheran, in 2008 and 2010. We present the definition and the fundamental properties of Fomin-Zelevinsky’s cluster algebras. Then, we introduce quiver representations and show how they can be used to construct cluster variables, which are the canonical generator...

متن کامل

Who Should be Interviewed? A Response from Cluster Analysis

Objective: This article presents an application of cluster analysis for social sciences researches especially those studies that have an interview as part of their data collection. This application is more suitable for sequential mixed method researchers who use quantitative data to frame subsequent qualitative subsamples for conducting interviews.  Methods: In more detail, the algorithm (i....

متن کامل

A New Method for Clustering Wireless Sensor Networks to Improve the Energy Consumption

Clustering is an effective approach for managing nodes in Wireless Sensor Network (WSN). A new method of clustering mechanism with using Binary Gravitational Search Algorithm (BGSA) in WSN, is proposed in this paper to improve the energy consumption of the sensor nodes. Reducing the energy consumption of sensors in WSNs is the objective of this paper that is through selecting the sub optimum se...

متن کامل

Seismic Data Forecasting: A Sequence Prediction or a Sequence Recognition Task

In this paper, we have tried to predict earthquake events in a cluster of seismic data on pacific ring of fire, using multivariate adaptive regression splines (MARS). The model is employed as either a predictor for a sequence prediction task, or a binary classifier for a sequence recognition problem, which could alternatively help to predict an event. Here, we explain that sequence prediction/r...

متن کامل

Use of key indicators to monitor sustainable development of rural areas

This study provides a multidimensional analysis of sustainable socio-economic development and its challenges in the rural areas of Ukraine. The methodology of realization of sustainable development’s conceptual provisions was created. The advantages of using indicative assessment at the regional level were justified. The methodical approach how to define the indicators of sustainable developmen...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017